Search CORE

10 research outputs found

On the Similarities Between Native, Non-native and Translated Texts

Author: Nisioi Sergiu
Ordan Noam
Rabinovich Ella
Wintner Shuly
Publication venue
Publication date: 01/01/2016
Field of study

We present a computational analysis of three language varieties: native, advanced non-native, and translation. Our goal is to investigate the similarities and differences between non-native language productions and translations, contrasting both with native language. Using a collection of computational methods we establish three main results: (1) the three types of texts are easily distinguishable; (2) non-native language and translations are closer to each other than each of them is to native language; and (3) some of these characteristics depend on the source or native language, while others do not, reflecting, perhaps, unified principles that similarly affect translations and non-native language.Comment: ACL2016, 12 page

arXiv.org e-Print Archive

Crossref

CoCo: A tool for automatically assessing conceptual complexity of texts

Author: Hulpus Ioana
Nisioi Sergiu
Štajner Sanja
Publication venue: European Language Resources Association
Publication date: 01/01/2020
Field of study

Traditional text complexity assessment usually takes into account only syntactic and lexical text complexity. The task of automatic assessment of conceptual text complexity, important for maintaining reader's interest and text adaptation for struggling readers, has only been proposed recently. In this paper, we present CoCo - a tool for automatic assessment of conceptual text complexity, based on using the current state-of-the-art unsupervised approach. We make the code and API freely available for research purposes, and describe the code and the possibility for its personalization and adaptation in details. We compare the current implementation with the state of the art, discussing the influence of the choice of entity linker on the performances of the tool. Finally, we present results obtained on two widely used text simplification corpora, discussing the full potential of the tool

MAnnheim DOCument Server

MDD @ AMI: Vanilla Classifiers for Misogyny Identification

Author: El Abassi Samer
Nisioi Sergiu
Publication venue: 'OpenEdition'
Publication date: 11/05/2021
Field of study

In this report, we present a set of vanilla classifiers that we used to identify misogynous and aggressive texts in Italian social media. Our analysis shows that simple classifiers with little feature engineering have a strong tendency to overfit and yield a strong bias on the test set. Additionally, we investigate the usefulness of function words, pronouns, and shallow-syntactical features to observe whether misogynous or aggressive texts have specific stylistic elements

OpenEdition

EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020

Author: Agerri Rodrigo
Aliprandi Carlo
Alkhalifa Rabab
Alzetta Chiara
Angel Jason
Anselmi Guido
Appiah Balaji Nitin Nikamanth
Aroyehun Segun Taofeek
Artigas Herold Maria Fernanda
Attanasio Giuseppe
Attardi Giuseppe
Badryzlova Yulia
Bai Yang
Baldissin Gioia
Ballarè Silvia
Barrón-Cedeño Alberto
Bartle Anna-Sophie
Basile Pierpaolo
Basile Valerio
Basili Roberto
Belotti Federico
Bennici Mauro
Bharathi B.
Bhuvana J.
Bianchi Federico
Bisconti Elia
Bolanos Luis
Bondielli Alessandro
Bosco Cristina
Breazzano Claudia
Brivio Matteo
Brunato Dominique
Cafagna Michele
Caputo Annalina
Caselli Tommaso
Cassotti Pierluigi
Castañeda Enrique
Castro Castro Daniel
Centeno Roberto
Cercel Dumitru-Clementin
Cerruti Massimo
Chandrabose Aravindan
Chesi Cristiano
Chiarello Filippo
Cignarella Alessandra Teresa
Cimino Andrea
Comandini Gloria
Croce Danilo
Dai Hongbing
Dascalu Mihai
Dell’Orletta Felice
Delmonte Rodolfo
Deng Tao
De Francesco Nazareno
De Martino Graziella
De Mattei Lorenzo
Di Buccio Emanuele
Di Maro Maria
di Nuovo Elisa
Di Rosa Emanuele
dos S.R. da Silva Adriano
Durante Alberto
El Abassi Samer
Espinosa María S.
Fabrizi Samuel
Fantoni Gualtiero
Ferilli Stefano
Ferraccioli Federico
Fersini Elisabetta
Finos Livio
Fiorucci Stefano
Fontana Michele
Frenda Simona
Gambino Giuseppe
Gatt Albert
Gelbukh Alexander
Giorgi Giulia
Giorgioni Simone
Girardi Paolo
Goria Eugenio
Gregori Lorenzo
Hoffmann Julia
Iacono Maria
Iovine Andrea
Izzi Giovanni Luca
Jimenez Sergio
Kaiser Jens
Kayalvizhi S.
Kivlichan Ian
Klaus Svea
Koceva Frosina
Kovács György
Kruschwitz Udo
Labadie Tamayo Roberto
Lai Mirko
Laicher Severin
Lapesa Gabriella
Lavergne Eric
Lebani Gianluca E.
Lebani Gianluca E.
Lees Alyssa
Lenci Alessandro
Leonardelli Elisa
Li Hongling
Liakata Maria
Lovetere Marco
Madonna Domenico
Massidda Riccardo
Mattei Lorenzo De
Mauri Caterina
Mele Francesco
Melucci Massimo
Menini Stefano
Miaschi Alessio
Miliani Martina
Moggio Alessio
Montagnani Matteo
Montefinese Maria
Montemagni Simonetta
Monti Johanna
Moraca Maurizio
Moretti Giovanni
Morra Simone
Murphy Killian
Muti Arianna
Nakov Preslav
Nisioi Sergiu
Nissim Malvina
Nozza Debora
Occhipinti Daniela
Ortega Bueno Reynier
Ou Xiaozhi
Palmonari Matteo
Parizzi Andrea
Pascucci Antonio
Passaro Lucia C.
Pastor Eliana
Patti Viviana
Pirrone Roberto
Polignano Marco
Politi Marcello
Pont Mattia Da
Pražák Ondřej
Proisl Thomas
Puccetti Giovanni
Přibáň Pavel
Radicioni Daniele P.
Rama Ilir
Rambelli Giulia
Ravelli Andrea Amelio
Rodrigo Alvaro
Rodriguez-Diaz Carlos A.
Rodriguez Cisnero Mariano Jason
Roman Norton T.
Roman Norton Trevisan
Rossmann Daniela
Rosso Paolo
Rotaru Armand Stefan
Rubino Edoardo
Russo Irene
Sabella Gianluca
Saini Rajkumar
Salman Samir
Sangati Federico
Sanguinetti Manuela
Sarti Gabriele
Schlechtweg Dominik
Schulte im Walde Sabine
Sciandra Andrea
Setpal Jinen
Siciliani Lucia
Solari Dario
Sorensen Jeffrey
Sorgente Antonio
Sprugnoli Rachele
Stranisci Marco
Tamburini Fabio
Taylor Stephen
Tesei Andrea
Thenmozhi D.
Tonelli Sara
Torre Ilaria
Tsakalidis Adam
Varvara Rossella
Venturi Giulia
Vettigli Giuseppe
Vlad George-Alexandru
Wang Benyou
Zaharia George-Eduard
Zamparelli Roberto
Zubiaga Arkaitz
Publication venue: 'OpenEdition'
Publication date: 11/05/2021
Field of study

Welcome to EVALITA 2020! EVALITA is the evaluation campaign of Natural Language Processing and Speech Tools for Italian. EVALITA is an initiative of the Italian Association for Computational Linguistics (AILC, http://www.ai-lc.it) and it is endorsed by the Italian Association for Artificial Intelligence (AIxIA, http://www.aixia.it) and the Italian Association for Speech Sciences (AISV, http://www.aisv.it)

OpenEdition

CoCo: A tool for automatically assessing conceptual complexity of texts

Author: Hulpus Ioana
Nisioi Sergiu
Štajner Sanja
Publication venue: European Language Resources Association
Publication date: 01/01/2020
Field of study

Identifying Source-Language Dialects in Translation

Author: Ana Sabina Uban
Liviu P. Dinu
Sergiu Nisioi
Publication venue: MDPI AG
Publication date: 01/04/2022
Field of study

In this paper, we aim to explore the degree to which translated texts preserve linguistic features of dialectal varieties. We release a dataset of augmented annotations to the Proceedings of the European Parliament that cover dialectal speaker information, and we analyze different classes of written English covering native varieties from the British Isles. Our analyses aim to discuss the discriminatory features between the different classes and to reveal words whose usage differs between varieties of the same language. We perform classification experiments and show that automatically distinguishing between the dialectal varieties is possible with high accuracy, even after translation, and propose a new explainability method based on embedding alignments in order to reveal specific differences between dialects at the level of the vocabulary

Directory of Open Access Journals

Exploring neural text simplification models

Author: Dinu Liviu P.
Nisioi Sergiu
Ponzetto Simone Paolo
Štajner Sanja
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

We present the first attempt at using sequence to sequence neural networks to model text simplification (TS). Unlike the previously proposed automated TS systems, our neural text simplification (NTS) systems are able to simultaneously perform lexical simplification and content reduction. An extensive human evaluation of the output has shown that NTS systems achieve almost perfect grammaticality and meaning preservation of output sentences and higher level of simplification than the state-of-the-art automated TS systems

Crossref

MAnnheim DOCument Server

Quality Estimation for Machine Translation

Author: Alvin
Angrosh Mandya
Asano Hiroki
Avramidis Eleftherios
Avramidis Eleftherios
Bach Nguyen
Beck Daniel
Beck Daniel
Beck Daniel
Bergen Zachary
Bergstra James
Biçici Ergun
Biçici Ergun
Biçici Ergun
Biçici Ergun
Biçici Ergun
Blain Frédéric
Bojar Ondřej
Buck Christian
Burstein Jill
Burstein Jill
Camargo de Souza José Guilherme
Carolina Scarton
Chase Lin Lawrence
Cohn Trevor
Dahlmeier Daniel
Denkowski Michael
Dong Fei
Dusek Ondřej
Felice Mariano
Findings
Formiga Lluís
Gamon Michael
Gandrabur Simona
Giannakopoulos George
Glavas Goran
Goldwasser Dan
Goodfellow Ian J.
Gustavo
Gustavo Henrique Paetzold
Hardmeier Christian
He Yifan
Hildebrand Silja
Hokamp Chris
Ive Julia
Jones Douglas A.
Kauchak David
Kim Hyun
Kiros Ryan
Koehn Philipp
Koponen Maarit
Krings Hans P.
Lafferty John D.
Lampouras Gerasimos
Landauer Thomas K.
Lavergne Thomas
Lin Chin-Yew
Logacheva Varvara
Logacheva Varvara
Lucia Specia
Mairesse François
Martins André F. T.
Mathias Sandeep
McLaughlin G. Harry
Meurers Detmar
Mikolov Tomas
Mikolov Tomas
Napoles Courtney
Napoles Courtney
Negri Matteo
Ng Hwee Tou
Ng Raymond W. M.
Nisioi Sergiu
Och Franz Josef
Paetzold Gustavo Henrique
Paiva Daniel
Parker Robert
Pedregosa Fabian
Persing Isaac
Popović Maja
Potet Marion
Quirk C. B.
Reiter Ehud
Richardson Matthew
Rikters Matīss
Rubino Raphael
Saggion Horacio
Sakaguchi Keisuke
Scarton Carolina
Scarton Carolina
Scarton Carolina
Servan Christophe
Shah Kashif
Singh Abhishek
Singh Anil Kumar
Snover Matthew
Soricut Radu
Soricut Radu
Soricut Radu
Specia Lucia
Specia Lucia
Specia Lucia
Specia Lucia
Specia Lucia
Specia Lucia
Specia Lucia
Stajner Sanja
Stajner Sanja
Steinberger Josef
Taylor Wilson L.
Turchi Marco
Turchi Marco
Ueffing Nicola
Venugopal Ashish
Wisniewski Guillaume
Wubben Sander
Xiong Deyi
Xu Wei
Zhang Hao
Zwillinger Daniel
Publication venue: 'Morgan & Claypool Publishers LLC'
Publication date
Field of study

Crossref

Automatic Text Simplification

Author: Aluísio Sandra
Aluísio Sandra Maria
Anderson Jonathan
Anula Rebollo Spanish
Anula Rebollo Ángel Alberto
Anula Rebollo Ángel Alberto
Aranzabe María Jesús
Barbu Eduard
Barthe Kathy
Bautista Susana
Belder Jan De
Biran Or
Bott Stefan
Bott Stefan
Bott Stefan
Bott Stefan
Brants orsten
Briscoe Ted
Brooke Julian
Burga Alicia
Burstein Jill
Carroll John
Charniak Eugene
Collins-ompson Kevyn
Colman Andrew M.
Coster William
Crossley Scott A.
Crystal David
Cunningham Hamish
Dale Edgar
Dale Edgar
Dell'Orletta Felice
Dempster A. P.
Devlin Siobhan
Diéguez José Rodríguez
Drndarević Biljana
Drndarević Biljana
Drndarević Biljana
DuBay William H.
Elhadad Noemie
Eskenazi Maxine
Feblowitz Dan
Feng Lijun
Ferrés Daniel
Freyhoff Geert
Gasperin Caroline
Gasperin Caroline
Glavas Goran
Glavas Goran
Gonzalez-Dios Itziar
Gunning Robert
Heilman Michael
Heilman Michael J.
Horacio Saggion
Jauhar Kumar Sujay
Kajiwara Tomoyuki
Kamp Hans
Kauchak David
Keselman A.
Keskisärkkä Robin
Kincaid J. Peter
Krovetz Robert
Lal Partha
Landauer omas K.
Lavelli Alberto
Lin Chin-Yew
Lin Yuri
Marimon Montserrat
Martín-Valdivia Maria Teresa
McLaughlin Harry G.
Medero Julie
Mel'cuk Igor
Mikolov Tomas
Mitchell omas M.
Muñoz Ignacio Bosque
Nisioi Sergiu
Ogden Charles Kay
Padró Lluis
Pianta Emanuele
Quinlan J. Ross
Quirk Randolph
Saggion Horacio
Sarah
Schmid Helmut
Schrijver Alexander
Schuurman
Selected Papers
Shardlow Matthew
Siddharthan Advaith
Siddharthan Advaith
Snover Matthew
Specia Lucia
Stajner S.
Stajner Sanja
Stajner Sanja
Stajner Sanja
Stajner Sanja
Stajner Sanja
Stajner Sanja
Vandeghinste Vincent
Walker Andrew
Williams Sandra
Woodsend Kristian
Wubben Sander
Xu Wei
Yaneva Victoria
Yaneva Victoria
Yatskar Mark
Zajic David
Zeng-Treitler Qing
Zhang Ying
Zhu Zhemin
Publication venue: 'Morgan & Claypool Publishers LLC'
Publication date
Field of study

Crossref